Model Selection

ViT Vision Transformer

# ViT Vision Transformer

Screenshots Detection To Classification

A screenshot detection and classification model based on ViT architecture, excelling in image classification tasks

Image Classification

Pneumonia Model

A deep learning model based on ViT architecture for identifying pneumonia symptoms in chest X-ray images

Image Classification

Facial Age Image Detection

A model trained using Vision Transformer (ViT) architecture to predict age ranges from facial images

Vit Base Patch16 224 In21k Face Recognition

This model is a face recognition model fine-tuned on an image folder dataset based on Google's ViT architecture, achieving near-perfect accuracy on the evaluation set.

Facial Emotions Image Detection

A facial emotion recognition model fine-tuned based on Google's ViT-base model, achieving 91% accuracy on the test set.

Medicinal Plants Image Detection

A Vision Transformer (ViT)-based image classification model for Indian medicinal plant leaves, capable of accurately identifying over 50 types of traditional Indian medicinal plants.

Image Classification

Vit Base Patch16 224 In21k Weather Images Classification

A weather image classification model based on Vision Transformer architecture, fine-tuned on the Kaggle weather dataset with an accuracy of 93.4%

Image Classification

Transformers English

Vit Base Patch16 224 Album Vitvmmrdb Make Model Album Pred

A visual classification model fine-tuned on an unknown dataset based on Google's ViT model, excelling in image classification tasks

Image Classification

Vit Face Expression

A facial emotion recognition model fine-tuned based on Vision Transformer (ViT), supporting 7 expression classifications

Vit Base Patch16 224 Finetuned Imageclassification

Image classification model fine-tuned on image folder dataset based on Google's ViT model, achieving 95.02% accuracy

Image Classification

Stanford Car Vit Patch16

This is an image classification model based on the Vision Transformer (ViT) architecture, specifically fine-tuned on the Stanford Cars dataset for fine-grained classification of 196 car models.

Image Classification

therealcyberlord

Dog Food Vit Base Patch16 224 In21k

This is an image classification model based on the Vision Transformer (ViT) architecture, specifically designed to distinguish between images of dogs and food.

Image Classification

Rock Challenge ViT Two By Two

This is an image classification model based on the ViT architecture, specifically designed for rock particle classification tasks, achieving an accuracy of 96.6%.

Image Classification

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase